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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: UNIVERSITE CATHOLIQUE DE LOUVAIN 

Halles Universitaires 

(B) STREET: Place de 1' Universite, : 

(C) CITY: LOUVAIN-LA-NEUVE 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : B-1348 

(A) NAME: UNIVERSITE DE MONS-HAINAUT 

(B) STREET: Place du Pare 20 

(C) CITY: MONS 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : B-7000 

(ii) TITLE OF INVENTION: PEROXISOME- ASSOCIATED PEPTIDE, NUCLEOTIDE 
SEQUENCE ENCODING SAID PEPTIDE 'AND THEIR USES IN THE 
DIAGNOSTIC AND/OR THE TREATMENT OF LUNG INJURIES AND 
DISEASES, AND OF OXIDATIVE STRESS-RELATED DISORDERS 

(iti.i) NUMBER OF SEQUENCES: 19 

i^) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible / 

(C) OPERATING SYSTEM: PC-DOS/MS-/DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 193 . .681 





(xi) SEQUENCE DESCRIPTION: SEQ 
GCCAGGAGGC GGAGTGGAAG TGGCCGTGGG GC 



D NO: 1: 

JGGTATGG GACTAGCTGG CGTGTGCGCC 



CTGAGACGCT CAGCGGGCTA TATACTCGTC GGTGGGGCCG GCGGTCAGTC TGCGGCAGCG 12 0 
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GCAGCAAGAC GGT GCAGTGA AGGAGAGTGG GCGTCTGGCG GGGTCCGCAG TTTCAGCAGA 180 

GCCGCTGCAG CC ATG GCC CCA ATC AAG GTG GGA GAT GCC ATC CCA GCA 228 
Met Ala Pro lie Lys Val Gly Asp Ala He Pro Ala 
1 5 10 

GTG GAG GTG TTT GAA GGG GAG CCA GGG AAC AAG GTG AAC CTG GCA GAG 276 
Val Glu Val Phe Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu 
15 20 25 

CTG TTC AAG GGC AAG AAG GGT GTG CTG TTT GGA GTT CCT GGG GCC TTC 324 
Leu Phe Lys Gly Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe 
30 35 40 

ACC CCT GGA TGT TCC AAG ACA CAC CTG CCA GGG TTT GTG GAG CAG GCT 372 
Thr Pro Gly Cys Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala 
45 50 55 60 

GAG GCT CTG AAG GCC AAG GGA GTC CAG GTG GTG GCC TGT CTG AGT GTT 420 
Glu Ala Leu Lys Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val 
65 70 ' 75 

AAT GAT GCC TTT GTG ACT GGC GAG TGG GGC CGA GCC CAC AAG GCG GAA 4 68 

Asn Asp Ala Phe Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu 
80 85 90 

GGC AAG GTT CGG CTC CTG GCT GAT CCC ACT GGG GCC TTT GGG AAG GAG 516 
Gly Lys Val Arg Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu 
95 100 105 

ACA GAC TTA TTA CTA GAT GAT TCG CTG GTG TCC ATC TTT GGG AAT CGA 564 
Thr Asp Leu Leu Leu Asp Asp Ser Leu Val Ser He Phe Gly Asn Arg 
110 115 120 

CGT CTC AAG AGG TTC TCC ATG GTG GTA CAG GAT GGC ATA GTG AAG GCC 612 
Arg Leu Lys Arg Phe Ser Met Val Val Gin Asp Gly He Val Lys Ala 
125 130 135 ~ 140 

CTG AAT GTG GAA CCA GAT GGC ACA GGC CTC ACC TGC AGC CTG GCA CCC 660 
Leu Asn Val Glu Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro 
145 150 * 155 

AAT ATC ATC TCA CAG CTC TGA GGCCCTGGGC CAGATTACTT CCTCCACCCC 711 
Asn He He Ser Gin Leu * 
160 

TCCCTATCTC ACCTGCCCAG CCCTGTGCTG GGGCCCTGCA ATTGGAATGT TGGCCAGATT 771 
TCTGCAATAA ACACTTGTGG TTTGCGGAAA AAAA 805 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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Met Ala Pro lie Lys Val Gly Asp Ala lie Pro Ala Val Glu Val Phe 
15 10 15 

Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu Leu Phe Lys Gly 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 



Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Glu Ala Leu Lys 
50 55 60 

Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val Asn Asp Ala Phe 
65 70 75 80 

Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu Gly Lys Val Arg 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser lie Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 

Phe Ser Met Val Val Gin Asp Gly He Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn He He Ser 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 136. . 624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGCGTCCTAG GCAGCATAGC CGGATCGGTG CTCCGTGCAT CGGCTACTTG GACGTGCGTG 



GCAGGCAGAG CAGGCCGGAA AGGAGCAGGT TGGGAGTGTG GTGGGGCCCG CAGCTTCAGC 



120 
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AGTGCCGCGG 


TGACTATGGC 


CCCGATCAAG 


GTGGGAGACA 


CCATTCCCTC 


AGTGGAGGTA 


180 


TTTGRAGGGG 


AACCTGGAAA 


GAAGGTGAAC 


TTGGCAGAGC 


TGTTCAAGGA 


CAAGAAAGGT 


240 


GTTTTGTTTG 


GAGTCCCTGG 


GGCATTTACA 


CCTGGCTGTT 


CCAAGACCCA 


TCTGCCTGGG 


300 


TTTGTGGAGC 


AAGCCGGAGC 


TCYGAAGGCC 


AAGGGAGCAC 


AAGTGGTGGC 


CTGTCTGAGT 


360 


GTTAATGATG 


YCTTCGTGAC 


TGCAGAGTGG 


GGTCGAGCCC 


ACCAGGCAGA 


AGGCAAGGTT 


420 


CAGCTCCTGG 


CTGACCCCAC 


TGGAGCTTTT 


GGAAAGGAGA 


CAGATTTACT 


ACTAGATGAT 


480 


TCTTTGGTGT 


CTCTCTTTGG 


GAATCGTCGG 


CTAAAAAGGT 


TCTCCATGGT 


GATAGACAAG 


540 


GGCGTAGTAA 


AGGCACT G AA 


CGTGGAGCCG 


GATGGCACAG 


GCCTCACCTG 


CAGCCTGGCC 




CCCAACATCC 


TCTCACAACT 


CTGAGGCCCT 


GACCAGAATG 


TCCTCTGACT 


CTCCCATCTC 


660 


CTCCACCCAG 


CTCTGGGCCA 


AAGGCCCAGT 


ACCTCCTTAC 


CTGAGGGCCA 


CTGGAATGGA 


720 


ACCTTGACAA 


TATTTCTGCA 


ATAAACAGTT 


TAATTTGTGft. 


AAAAAAAAAA 


AAAAAAAAAA 


780 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /product= "Glu or Gly" 

(ix) FEATURE: 

(A) NAME/ KEY : Modified-site 

(B) LOCATION: 63 

(D) OTHER INFORMATION: /product= "Leu or Pro" 

(ix) FEATURE: 

(A) . NAME/ KEY : Modified-site 

(B) LOCATION: 79 

(D) OTHER INFORMATION :/product= "Ala or Val" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Pro lie Lys Val Gly Asp Thr lie Pro Ser Val Glu Val Phe 
15 10 15 
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Xaa Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Glu Leu Phe Lys Asp 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Xaa Lys 
50 55 60 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Xaa Phe 
65 70 75 80 

Val Thr Ala Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Gin 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 

Phe Ser Met Val lie Asp Lys Gly Val Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn lie Leu Ser 
145 150 155 160 

Gin Leu 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 99. .588 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGCTCCGTGC ATCGACGTGC TTGGCAGGCA GAGCAGGCCG GAAAGAAGCA GGTTGGGAGT 60 

GTGGCGGAGC CCGCAGCTTC AGCAGCTCCG CGGTGACCAT GGCCCCGATC AAGGTGGGAG 120 

ATGCCATTCC CTCAGTGGAG GTATTTGAAG GGGAACCGGG AAAGAAGGTG AACTTGGCAG 180 

AGCTGTTCAA GGGCAAGAAA GGTGTTTTGT TTGGAGTCCC TGGGGCATTT ACACCTGGCT 240 
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GTTCTAAGAC CCACCTGCCT GGGTTTGTGG AGCAAGCTGG AGCTCTGAAG GCTAAGGGAG 300 

CGCAGGTGGT GGCCTGTCTG AGCGTTAATG ACGTCTTTGT GATTGAAGAG TGGGGTCGAG 360 

CCCACCAGGC AGAAGGCAAG GTTCGGCTCC TGGCTGACCC CACTGGAGCC TTTGGGAAGG 420 

CGACAGACTT ATTATTGGAT GATTCTTTGG TGTCTCTCTT TGGGAATCGT CGGCTGAAAA 480 

GGTTCTCCAT GGT GATAGAG AACGGCATAG TGAAGGCACT GAACGTGGAG CCAGATGGCA 540 

CAGGCCTCAC CTGCAGCCTG GCCCCCAACA TCCTCTCCCA ACTCTGAGGC CCTGGCCAGA 600 

TGTCCTCTGA CTCTCCCATC TCTCCCACCC GGCTCTAGGC CAAAAGGCTC GGTACCTCCT 660 

TACTGGGAGC CACGT 675 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Pro lie Lys Val Gly Asp Ala lie Pro Ser Val Glu Val Phe 
15 10 15 

Glu Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Glu Leu Phe Lys Gly 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Leu Lys 
50 55 60 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Val Phe 
65 70 75 80 

Val He Glu Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Arg 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Ala Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 
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Phe Ser Met 
130 



Val He Asp Asn Gly He Val Lys Ala Leu Asn Val Glu 
135 140 



Pro Asp Gly 
145 



Thr Gly Leu Thr Cys Ser L< 
150 



Ala Pro Asn He Leu Ser 
155 160 



Gin Leu 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

<ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 161. .382 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 120 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGGTTCG 18 0 

GCTCCTGGCT GATCCCACTG GGGCCTTTGG GAAGGAGACA GACTTATTAC TAGATGATTC 24 0 

GCTGGTGTCC ATCTTTGGGA ATCGACGTCT CAAGAGGTTC TCCATGGTGG TACAGGATGG 300 

CAT AGT GAAG GCCCTGAATG TGGAACCAGA TGGCACAGGC CTCACCTGCA GCCTGGCACC 360 

CAATATCATC TCACAGCTCT GAGGCCCTGG GCCAGATTAC TTCCTCCACC CCTCCCTATC 420 

TCACCTGCCC AGCCGTGTGC TGGGGCCCTG CAATTGGAAT GTTGGCCAG 4 69 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 161. .514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 120 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGACACA 180 

CCTGCCAGGG TTTGTGGAGC AGGCTGAGGC TCTGAAGGCC AAGGGAGTCC AGGTGGT GGC 24 0 

f 

CTGTCTGAGT GTTAATGATG CCTTTGTGAC TGGCGAGTGG GGCCGAGCCC ACAAGGCGGA 300 

AGGCAAGGTT CGGCTCCTGG CTGATCCCAC TGGGGCCTTT GGGAAGGAGA CAGACTTATT 360 

ACTAGAT GAT TCGCTGGTGT CCATCTTTGG GAATCGACGT CTCAAGAGGT TCTCCATGGT 420 

GGTACAGGAT GGCATAGTGA AGGCCCTGAA TGTGGAACCA GATGGCACAG GCCTCACCTG 480 

CAGCCTGGCA CCCAATATCA TCTCACAGCT CTGAGGCCCT GGGCCAGATT ACTTCCTCCA 540 

CCCCTCCCTA TCTCACCTGC CCAGCCCTGT GCTGGGGCCC TGCAATTGGA ATGTTGGCCA 600 

G 601 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 161. .517 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 



60 
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TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 120 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGGTGGG 180 

AGATGCCATC CCAGCAGTGG AGGTGTTTGA AGGGGAGCCA GGGAACAAGG TGAACCTGGC 24 0 

AGAGCTGTTC AAGGGCAAGA AGGGTGTGCT GTTTGGAGTT CCTGGGGCCT TCACCCCTGG 300 

ATGTTCCAAG GTTCGGCTCC TGGCTGATCC CACTGGGGCC TTTGGGAAGG AGACAGACTT 360 

ATTACTAGAT GATTCGCTGG TGTCCATCTT TGGGAATCGA CGTCTCAAGA GGTTCTCCAT 42 0 

GGTGGTACAG GATGGCATAG TGAAGGCCCT GAATGTGGAA CCAGATGGCA CAGGCCTCAC 4 80 

CTGCAGCCTG GCACCCAATA TCATCTCACA GCTCTGAGGC CCTGGGCCAG ATTACTTCCT 54 0 

CCACCCCTCC CTATCTCACC TGCCCAGCCC TGTGCTGGGG CCCTGCAATT GGAATGTTGG 600 

CCAG 604 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 2516. .2710 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 2074. .2135 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 1932. . 1970 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 1728. .1859 

(ix) FEATURE: 

(A) NAME/ KEY : exon 

(B) LOCATION: 802. .936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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TCTGTCCCTT AGCGCCCCCG CGGGGGCTTA CCCCATCCCA CTCCATGACC TCCCCTCCCC 60 

CCATGGCGAA TTCCCACCTT TCTGTCTTTC ACTCACTTCC TGGAACCGTC CCCAGGGCCT 120 

TGGACCTTCC CCCTTCTCCT CCCAAACCTT GTGAGACCCC ATTCCCTTTC TACTT CAT C C 180 

TGCTCTCAAC TTTTGGGCTC CTCAGAGGCC CTCACCCCTG ACTCTCTCTC CCTACCCACT 240 

CTGGTCCCAT GAAGCCCTCA AGTACTCTGG GGATGGATCC TTCCCCCTTC AAAAGATTCC 300 

TTCTTTTGTT CTACACCTCC TGGGTGTAGG GGCCTGGACA CCCTCCCCCA ACGTTCCACC 360 

TGCCGCTGCC CTTCCTCTTC CTCCTCCTGA GGGTGGGACC CTCAGACCTG GCCAAGATCC 420 

TCTCCCTCCA TGTTGTCAGG GACTCCTCCT CACCCCCAAA TACAGCCCTC TAGCCCCTGT 4 80 

CCATTTTATT CCACTCCTTT CCTGTAACCT AGACAG CAT G TTATGCAACC CTTTGCGACA 54 0 

CATGGGGAAA CCTTCCCTCC CTTCCTCTGT TGTCACCAAT GGCCCCTTAA GAGGAGCAGG 600 

GCCACCTTGA AACTTGGAGG ATATGGGGTA ACCCAGTGGG AGCGGGCAGG GAGGGCCCTT 660 

GGAAACTGAC AGGGCTGGAG TATCCTGCTG GGTTTCAGCC CCGGTTCCTG CAGGCACAGC 720 

TGCCAGGCTC TCTGTTCACC TTCCTGCCTC TGGTTTGCCC CGGCTCCCTC ACCCCCCTTA 78 0 

CCCTGGAGTC CTTCCTTCTA GGT GGGAGAT GCCATCCCAG CAGTGGAGGT GTTT GAAGGG 840 

GAGCCAGGGA ACAAGGTGAA CCT GGCAGAG CTGTTCAAGG GCAAGAAGGG TGTGCTGTTT 900 

GGAGTTCCTG GGGCCTTCAC CCCTGGATGT TCCAAGGTGA GGCCCTTCCC CTTCTGAAGA 960 

TCAGGACCTG GGGATCTTTT GTGTTGCTCT TAAGTCCTCC ACATAGTCCT GATAGGACT C 1020 

CTAAAAAGCA TTTCAGTGCC ATCACAAAAC AAGTAGAGCT GGGTAGAGCT GGGCGCGGTG 1080 

GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCAAGGCG GGTGGATCAC GAGGTCAGGA 114 0 

GTCCAAAACC AGCCTGGCCA AGATGGTGAA ACCCTGTCTC T AC T AAAAAT GCAAAAAAAT 12 00 

CAGCCGGATA TGGTGGCGGG CGCCTGTAAT CCCAGGTATT GGGGAGGCTG AGGCAGAGAA 1260 

TTGCTTGAAC CCAGGAGGCG TAGGTTGCAG TGAGTGGAGA TCGTGCCTCT GCAGTCCAGC 1320 

CTGGGTGAAA GAGC GAGACT CCGTCTCAAA ATGAAAAAAA AAAAAGAAAA CAAGTAGAGA 1380 

CTGCAAAAAG GGAACAGTAC CGGGAATGTT GGAGAAAAAC ATACTACAAT TAAATCCAAC 1440 

ACCCCTGTTG GTCCTGCTAA ATGACAGGCA CTGTGGAAGG TGCTTGGGAC TCAGATAAAT 1500 

AAGACAAAGA TCTGCCCATG GAAAGTTCAC GTCTGGACCA TAAG GCATTA GGTTTCATTC 1560 

TGAGCTTCCT AGTGGCCAAG GCAAAAAGGA AATAGAATGG TTTAGACAGC TCTCATTGTC 1620 

TGAT CAAAGG TGTTGAGGCA GAGCACTGAG GAGGGCCTGG AGATAAAGGG TGGGCTGGGG 1680 

GTCAGATGCA GTTATCCCTT TGCCGACCCT TTGTTCCCCT TCCTCAGACA CACCTGCCAG 174 0 

GGTTTGTGGA GCAGGCTGAG GCTCTGAAGG CCAAGGGAGT CCAGGTGGTG GCCTGTCTGA 1800 

GTGTTAATGA TGCCTTTGTG ACTGGCGAGT GGGGCCGAGC CCACAAGGCG GAAGGCAAGG 1860 
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TGAGGTGAGG GGCCTGCAGG GAGTCAGGAC CAGGTAGGAT ATTCTTCTTG TGACCTCTAC 1920 

TTTCTCTGCA GGTTCGGCTC CTGGCTGATC CCACTGGGGC CTTTGGGAAG GTGAGTGTTC 1980 

CCCTGACCGC CACAGGGACA TGGCGGTGCG GGGAGCAGTG GGGGCCCTTG GCCTCTTCAA 2040 

GGATTTCTGA CACTTTTCTC TGTCTCTTCT TAG G AGACAG ACTTATTACT AGATGATTCG 2100 

CTGGTGTCCA TCTTTGGGAA TCGACGTCTC AAGAGGTAAA AGTGGAGAGT CCTCTGTGGA 2160 

GAAAGTCCTC TGTGGGAGAG AGTCCTCTGT GGGAGAGAGT CCTCTGTGGA GAGGGTCCTC 2220 

TGTGGGAAGA GTCGTCTGTG GGGGAGATGT GTGGGAGAGA GTCCTGTGTG GGGAGAGTCT 228 0 

TCTGTAGGGG AGAGTCCTCT GGGGAGAGAG TCCTGTGTGG GGGAGAGTCC TCTGTGGGGA 2340 

GAGTCCTCTG TGTGGAGAGA GTCCTGTGTG GTGGTGAGTC CTCTGTGGGG GAGAGTCCTC 2400 

TGTGGGGGGA GTCCTCTCTG GAGTTCTCTT GGGCCCCTGG CTGTTCACTG CCTGTCTCCA 24 60 

r 

TGCCCAGCCT CCAAGCCCAG GCTGATGCAG CTGGCTGGGC CCCTCTTTCC GGCAGGTTCT 2520 

CCATGGTGGT ACAGGAT GGC ATAGTGAAGG CCCTGAATGT GGAACCAGAT GGCACAGGCC 258 0 

TCACCTGCAG CCTGGCACCC AAT AT CAT CT CACAGCTCTG AGGCCCTGGG CCAGATTACT 264 0 

TCCTCCACCC CTCCCTATCT CACCTGCCCA GCCCTGTGCT GGGGCCCTGC AATTGGAATG 2700 

TTGGCCAGAT 2710 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCATCCCAG CAGTGGAGGT GTTTG 25 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTGAACAGCT CTGCCAGGTT CACC 24 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGAGGTGTT TGAAGGGGAG CCAG 24 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAGGTTCACC TTGTTCCCTG GCTC 2 4 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGGTATGGGA CTAGCTGGCG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CTGGCCAACA TTCCAATTGC AG 



22 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AT GTTAT GCA ACCCTTTGCG ACAC 24 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTGTTTGAAG GGGAGCCAGG GAAC 24 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGAGACAGGG TTTCACCATC TTGG 



24 



